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There has been a considerable effort to understand and quantify the spatial distribution of species across 
different ecosystems. Relative species abundance (RSA), beta diversity and species area relationship (SAR) are 
among the most used macroecological measures to characterize plants communities in forests. In this article we 
introduce a simple phenomenological model based on Poisson cluster processes which allows us to exactly link 
RSA and beta diversity to SAR. The framework is spatially explicit and accounts for the spatial aggregation of 
conspecific individuals. Under the simplifying assumption of neutral theory, we derive an analytical expression 
for the SAR which reproduces tri-phasic behavior as sample area increases from local to continental scales, 
explaining how the tri-phasic behavior can be understood in terms of simple geometric arguments. We also find 
an expression for the endemic area relationship (EAR) and for the scaling of the RSA. 

I. INTRODUCTION 

The relation between the mean number of different species observed within a given sampled area, i.e. the Species-Area 
relationship (SAR), is one of the most studied patterns in ecology and represents one of the simplest ways to characterize the 
biodiversity of a region. There is a considerable body of research fl]-[6] showing that the curve of the SAR is a non-decreasing 
function whose slope depends on the sampled area and has a characteristic shape in a log-log plot (see Figure [TJ. This "tri- 
phasic" curve is relatively steeper at local and continental scales, but shallower at intermediate scales. This latter regime is 
typically described by a power-law S ~ A z , even though there is no compelling theoretical reason to choose such a function. A 
wide range of models has been suggested in recent years to account for this shape: some of them are based on geometrical Q 
or statistical considerations IH]|9), while others show that there are biological traits which can affect the shape of the SAR 1 10 1. 

Simplified theoretical frameworks of population dynamics, such as the neutral theory of biodiversity [11], have made consid- 
erable progress in predicting several patterns at different spatial and temporal scales [12-18 1, including the SAR |[T9l . Despite 
the simplicity of its core assumption, i.e. all individuals within a trophic level have the same probabilities to die or survive 
irrespective of the species which they belong to, the framework has provided a baseline expectation for a variety of patterns akin 
to those observed in empirical data. Thus, on the one hand it represents a powerful tool to investigate a series of underpinning 
mechanisms at the core of universal ecological behaviors; on the other, it questions more complex explanations for empirical 
patterns. The original formulation [ 1 1 1 and the majority of neutral models suggested later on have dealt with spatial features only 
implicitly or the predictions are obtained via scaling relations [20| . Within such approaches the dispersal abilities of species 
are captured only approximately, although in an analytical tractable way. Spatially explicit models represent a substantial step 
towards a more realistic study of ecosystems, but present much greater theoretical challenges with respect to their implicit coun- 
terparts. In fact, spatial ecological measures such as the species area relationship crucially depend on the behavior of multiple 
points correlation functions, and any truncation would inevitably impair the predictions. As a consequence, one needs to solve 
the model in full generality, a task that is highly non trivial because stochastic theories defined on space often have stationary 
states for which detailed balance does not hold. This condition ensures that at stationarity the probability to go from one config- 
uration to another one is the same as the reversed transition ||2T1|221 . Recently, O'Dwyer & Green l23l have derived the SAR 
from a fully spatially explicit model by using field theoretical techniques. However, their findings were implicitly obtained under 
the assumption that the Detailed Balance is satisfied ll24l . a condition that is not correct for their model. 

Within a neutral setting, we introduce a simple mechanism which is able to produce a tri-phasic SAR and can be explained in 
simple geometrical terms. The model is based on the Poisson Cluster Processes E51 I26I and allows us to derive the SAR, the 
endemic area relationship and also the spatial scaling of the RSA. 

II. EMERGENT GEOMETRY OF POISSON CLUSTER PROCESSES 

Poisson Cluster Processes and Neyman-Scott processes [25-29 1 (PCP in the following) are a very general framework useful 
to analyze spatial ecological data and characterize population aggregation 1 30-32]. These processes are quite simple and based 
on the assumption that individuals are spatially clumped in clusters. Specifically, the centers of clusters are distributed in space 
with a constant density independent of each other. Each cluster is populated by a random number of individuals (drawn from a 
given distribution) and the distance of each individual from the center of the cluster is drawn from a given distribution (typically 
a Gaussian distribution with a certain variance £ 2 ). 
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FIG. 1: Qualitative shape of the Species-Area relationship [11]. On local spatial scales (region A) the trend is steep. On 
intermediate spatial scales (region B) the slope decreases and the curve is well approximated by a power law with exponent z. 
Finally, on very large spatial scales (region C), the linear size of sampled areas is much greater than the correlation length of 
biogeographic processes, so that the majority of species are completely independent of each other. 



We consider a simplified version of the PCP in a homogeneous landscape of area A§, assuming that: 

1 . Species are independent of each other. 

2. The individuals of any species are distributed around a single center whose location is uniformly drawn within the land- 
scape. 

3. The position of individuals with respect to the center is drawn from a given distribution <fi(r), where r is the position with 
respect to the center. The distribution has a characteristic scale £ above which it decreases exponentially. 

4. The number of individuals per species are drawn from a given Relative Species Abundance (RSA) distribution Sk(Ao). 

The assumption in item 3 takes into account that individuals belonging to the same species are usually spatially aggregated 
(Plotkin 2002) - we use a single cluster center (item 2) for the sake of simplicity. Here we do not focus on the biological 
mechanisms underlying conspecific spatial aggregation, but we account for it in a phenomenological fashion. Because we 
assume that species are independent (item 1) and every species is characterized by the same model parameters, the model is 
non-interacting and neutral as well. 

The model is formulated as neutral, i.e. every species behaves in the same way. If neutrality holds and under the assumption 
of species independence, we can consider simply one species at a time to calculate every quantity. Within this model we 
can explicitly calculate the SAR for a homogeneous and large landscape. Under these hypotheses we obtain the species-area 
relationship simply from the probability of finding at least one individual in a given sub-region of area A by [23 1: 

oo 

S(A\A ) = Stot(A ) Pk(A\A ) = S tot (A ) [l - P (A\A )] , (1) 

fc=i 

where S to t(Ao) is the total number of available species in the whole system with area A Q (i.e. J^kLo ^k(A-o)), while P^(A\A ) 
is the probability of finding exactly k individuals of a given species in the sub-region of area A and has the following expression 



(see Appendix A i 



Pl( A\M) = r*w4 / ft!^>L^«*i , (2) 

Jo A Q JAo k - 
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where A(r) is a region of area A centered at the point r. The distribution p(X) is strictly related to the RSA, Sk(Ao), implicitly 
defined by the following equation 



S k (A ) = S tot (Ao) I dXp(X) — — — . (3) 







Interestingly, the equation |5]reduces to the random placement model 11331 in its mean field version, i.e. by considering 4>{r) to 



be constant. On the other hand, it is possible to relate the quantity <j)(r) to the two point correlation function (see Appendix B i 
The correlation function is proportional to the /3-diversity (which is a well known measurable quantity in real systems |34|) 
Specifically, we obtain the following relation 

G 2 (r) = (A 2 ) f <Py<f>(y)<Kv-r), (4) 

JAo 

where (A 2 ) = J °° d\X 2 p(X). Thus we can directly obtain an expression for </>(r) when the correlation function. By applying 

the Fourier transform, it is possible to invert equation j^J obtaining tfi(p) oc \J Gi (p) (where tfi(p) is the Fourier transform of 
(r), see Appendix B i. The formula in Eq. [4] with an appropriate choice of <f>(r), has the same structure obtained with different 



models 11151 1341 . 



III. RESULTS 
A. Species Area Relationship 



The final expression of the Species-Area Relationship is obtained by substituting equation [2] into equation [T] and taking the 
limit Aq — > oo in the spirit of 1 19 1. We find (see Appendix C i that the average number of species found within a sampled area 



is equal to 

S(A) = s tot I d\\l - I dXp(X)e- X ^r_ ) d 2 r'Hr') 



(5) 

The quantity s tot is obtained from the limit lim^^oo Stot {Ao)/Aq and it has the interpretation of an effective density of species. 
In | Appendix C| we show that this quantity is well defined, i.e. the limit does exist and is different from zero. Note that in this 
way we have obtained an analytic expression for the SAR if we are given the RSA and the pair correlation function. The model 
generates the spatial aggregation of individuals in a very simple way and without resorting to any explicit biological mechanism 
(see Figure E}, and therefore the emergent spatial distribution could potentially describe spatial features of species with very 
different traits, dispersal abilities or habitat preferences. Thus, the SAR in equation [5] is more the result of basic geometrical 
features than the effect of underlying biological mechanisms. This consideration is important especially when one tries to infer 
the effects of fundamental mechanisms simply by comparing empirical data to analytical curves obtained from more complex 
models. 

We can extract some general information from equation [5] independent of the specific form of the RSA and the correlation 
function. . Because £ is a correlation length and characterizes the spatial scale over which a species is distributed, from 



dimensional analysis (see Appendix Di we have that S(A) = stotAf(A/£ i 2 ). We can study the SAR for small and large areas 
(which can be obtained as an expansion for i>( 2 and A <C £ 2 ). The small area expansion gives, regardless of the choice of 
the RSA or the spatial distribution, the following result (see | Appendix E| > 

S(A) ~ (p)A = N(A) , (6) 

where is the density of individuals and N(A) is the number of individuals in the area A. This is an expected result: when 
we sample small areas, the majority of sampled individuals belong to different species (and thus the number of species grows 
linearly with the number of individuals). This result (S ~ N) is valid for all areas when </>(r) = cost and corresponds to the 
Random Placement model 11331 in the limit of large Aq. For large areas (i.e. areas much larger than the one given by the typical 
correlation scale), we obtain 

S(A) ~ stot (l - J dAp(A)e- A ) = sA , (7) 

where s is defined as the average density of observable species (i.e. with at least one individual in the whole landscape). At large 
spatial scales the mean number of species grows linearly with the sampled area and the spatial aggregation of individuals is no 
longer important, only the total density of species s matters. 
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Now we focus on specific forms of the RSA and the correlation function in one idealized example. We consider the Fisher log- 
series for the RSA lfTTIl35l (i.e. S k = 0x k /k) and the Bessel function A'o for the correlation function |34|. The corresponding 
choice for p(\) is an appropriate limit of the Gamma distribution (curiously the Fisher log-series was firstly introduced by 
Fisher l35l exactly via p(A)), while for </>(r) we obtain (see Appendix~F| i 



(r) = CxpHldl/e) (8) 



By substituting this expression in equation|5]we obtain 

l-x(l-I(A,r)) 



S{A) = 6 J dr log 



1 - x 



(9) 



where I{A, r) = J A ^ dr?cj)(r'). In general, the integral in Eqj9jdoes not have a closed form, however it can be easily evaluated 
numerically and the result is shown in Figure|3]for </>(r) given in equation[8] 

The SAR shows a linear growth at small as well as large scales as predicted by the general consideration above and an 
approximate power-law at intermediate regions. The scale between the power-law trend and the large area linear growth is 
totally determined by the shape and characteristic scale of the correlation function (i.e. of the /3-diversity). For example, if we 
consider <p(r) equal to zero outside a circular region with radius £ and constant inside, th e scale will b e equal to A 2 = 7r£ 2 . In 
our case, where we have used Kq as the correlation function, we obtain A 2 — £ 2 167r (see Appendix G I. In Figure [5] we plot the 



result in units of this area, showing that the scale does not depend on x. The scale A\ between the rapid growth at small scales 
and the power-law behavior depends on the RSA through the parameter x. We observe a rapid growth at small areas because we 
are sampling individuals of different species, this trend starts to bend when we collect more individuals of the same species. This 
happens at a scale equal to the typical distance between conspecific individuals, i.e. the scale A\ is the average area occupied by 



one individual of a given species. In Appendix G we calculate this scale to be 



A 1 = h(x)A 2 = (1 - x) X l0g 2 (1 X) A 2 . (10) 

X s 

In Figure[3]this quantity is plotted for the SAR with different values of x. We find that the SAR shows a linear trend with a slope 
equal to density of individuals (p) for areas A < A\ — h(x)A2, a power-law trend S ~ A z for A\ < A < A 2 and a linear 
growth at scales A > A 2 where the proportionality constant is equal to the average density of species s. 

At intermediate scales the derivative d log S(A) jd log A varies slowly, so the behavior of the SAR can be well approximated 
by a power-law if the exponent z is defined as the slope at the inflection point, i.e. the minimum of dlog S(A)/d\og A. We 
show the result in the right panel of Figure[3] The exponent z, in this version of the model, depends only on the parameter x and 
ranges between 0.15 and 0.4, which is the range of observed values see e.g. ifTTI . The parameter x is the parameter of the Fisher 
log-series, which is assumed to be the RSA of the entire system. By using the relations which relate the speciation rate and the 
density of individuals iTPTl . we obtain that, for reasonable values of the speciation rate v,\ — x ~ si>/(p\. The model predicts a 
value of the exponent x between 0.15 and 0.4 for reasonable values of 1 — x between 10~ 3 and 10~ 9 [34|. 

The model allows one to calculate not only the SAR, but also the probability to find k species within an area A. Under the 
hypotheses of neutrality and of the absence of interactions, the probability 1 — _Po(^4|^-o) to find a given species in a certain 
sub-region of area A is independent on the other species. Due to the absence of interactions, the joint probabilities to find a given 
set of species factorize and then the probability to find k species in a sub-region of area A will be a Binomial distribution 

P k s (A\A ) = ( Stot l Ao) ) (1 - Po(A\A )) k (P (A\A )) S(Ao) - k . (11) 

In the limit of Aq — > oo, Stot(Ao) tends to infinite while 1 — Po(^-I^o) tends to 1 with a finite product and thus the distribution 
in the large Aq limit turns to be a Poisson distribution with average S(A) 

P k s (A) = ^^eMS(A)). (12) 

Therefore in the large total area limit the probability to find k species in a sub-region of area A is a Poisson distribution with 
average S(A). This is a prediction that could be simply tested with empirical data. The result is not specific of our model but 
is generally valid under the non-interacting assumption. Thus this represents an interesting and practical way to measure the 
macroscopic effect of the interactions between species at different scales. 



5 



Log Area 




FIG. 2: This figure shows the mechanism which produces the tri-phasic SAR as explained by the model. Different colors 
indicate different species. At large spatial scales the sampled areas are larger than the typical one occupied by a given species: 
this produces the linear scaling observed at large areas. When we observe the system at intermediate scales, the distribution of 

individuals follows a non-trivial spatial organization which corresponds to power-law-like behavior. Instead, at very small 
scales, on average every individual belongs to a different species, thus making the scaling with the area linear. This shows that 
the tri-phasic SAR can be understood in terms of very general geometric considerations. The figure at large scales is obtained, 
for graphical reasons, in a regime of relative small s, which introduces strong fluctuations in the density of individuals. The 

density of individuals is constant for reasonable values of s. 
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FIG. 3: The Species-Area relationship and the exponent z. The left panel shows the SAR obtained for different values of x 
(solid lines) where we have set 6 = 1, This choice is justified because the qualitative behavior of S(A) and the exponent z are 
independent of 8. The area is measured in units of A^, the area at which the linear behavior of the SAR sets in. The dashed 
black segments represent the two scales A\ and A2 obtained in equation [10] which separate the different regimes. The right 
panel shows the values of the exponent z (obtained at the inflection point) for different values of x. It spans the observed values 

for reasonable values of x. 
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B. Endemic Area Relationship 

While the SAR is defined as the average number of species in an area A, the Endemic Area Relationship (EAR) ll36l is the 
average number of species whose individuals are completely contained in an area A. This quantity has a fundamental importance 
in ecology, because gives an estimation of the number of immediate extinction due to a loss of space (further extinctions might 
take later). Within our framework we can obtain an expression EAR and its relation with the SAR. 

The general formula for the EAR can be obtained by calculating the number of species with zero abundance outside a sub- 



region of area A (see Appendix H I 



E(A) = Stot / d 2 z / d\p(\)e- x [e Xl M^ d - l] . (13) 



This expression depends on the distributions p(X) and <j>(r) which are related to the RSA of the entire system and to the f3- 
diversity. The EAR corresponding to the case analyzed for the SAR is 

E(A) = -9 J dz\og(l - xI(A,z)j . (14) 

This expression is compared to the scaling of the SAR (see equation|9]l in Figure|4] Interestingly, the EAR seems to be linear up 
to the correlation length. The EAR becomes quite similar to the SAR for length scales larger than the correlation length, because 
we are considering areas much larger than the typical space occupied by a species. In Figure [4] we observe that the EAR shows 
a linear trend at small scales, by expanding equation [14] we obtain 

E{A)~QxA, (15) 

if A <C Ai. This approximation is equivalent to the result of the random placement [33 1. Note that the trend of the EAR depends 
weakly on the value of x when it assumes the empirical values which are typically close to 1. On the contrary its depends on 
the biodiversity parameter 0. This approximation, as shown in Figure |4j is valid, for values of x close to 1, also at the scales at 
which the SAR shows the power-law trend (which are the most interesting scales from theoretical point of view based on the 
experience gained in statistical mechanics of continuum transition characterized by power-law behavior and universality). 
We can also calculate, as done for the SAR, the probability distribution of EAR, defined as the probability that in an area 



A there are k endemic species. By using the same arguments used for SAR we demonstrate in the Appendix H that the EAR 
follows a Poisson distribution with average E(A). It is interesting to study the probability to find at least one endemic species 
Pf AR (A), which is distributed as 

P e EAR (A) = 1 - Pq AR (A) = 1 - er E ^ , (16) 

where P f EAR (A) is the probability that any endemic species is found in an area A. The plot of this quantity is shown in Figure|4] 
We observe that this probability has a non trivial scaling with a rapid increase at the scale at which E(A) approaches to one. 
Therefore it exists a typical scale over which we observe endemic species. Our framework allows to determine this scale. As 
shown in Figure [4]the scaling of the probability is well approximated, at the interesting scales, by substituting the expansion of 



the EAR at small scales. By using the expansion of equation 15 we can calculate the typical area A c at which the probability of 



equation 16 becomes equal to 1/2 



log(2) 

= HIe ' (17) 



This expression is valid only if A c <C A 2 , because it follows from the expansion of equation 15 But this is the interesting case, 
because, for the typical ecological application (e.g. to have an estimation of the extinction debt), it is important to know the EAR 
behavior at small scales. Note in Figure [4] that, due to the shape of the EAR, the linear approximation for the EAR is a lower 



bound of its real value (i.e. the expression of equation 15 which is a good approximation at small scales, is always lower than 



the real value at bigger scales). This fact simply implies that the real value of the area A c (the area at which the probability to 



find an endemic species is equal to 1/2) is always lower than the value of equation 17 which is a good approximation at small 
scales. 



C. Relative Species Abundance 



In this section we obtain an expression for the RSA restricted to a given sub-region. This quantity is defined as the number of 
species Sk with a certain individual abundance k. To obtain an expression for the SAR and the EAR, we have postulated a form 
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FIG. 4: Endemic -Area Relationship. In the Figure A we compare the SAR (green curve) with the EAR (red curve), obtained 
respectively from equation [9] and 14 The curve are plotted for x = 1 — 10~ 7 and 9 = 1. In Figure B we show the probability to 
find at least one endemic species (see equation [To). The black curve is obtained by integrating equationfTJ] while the turquoise 



dotted curve is obtained with the approximation of the equation 15 The figure inside is a comparison between the EAR and the 



approximation at small scales of equation 15 The area unit are the same as in figure|3] 



of the RSA in the whole landscape Sk(Ao). Starting from this input, within our framework, we can obtain an expression for the 
RS A Sk (A) in a sub-region of area A. 



The general formula for the RSA restricted to a sub-region is obtained in Appendix A and, in the limit of large Aq, turns to be 



8 k (A) = stat I d 2 z J d\p(X) 



■ exp( - A / d 2 r(j)(r) 



(18) 



Note that this expression is consistent with our expression for the SAR: by summing over k we obtain equation [5] As done for 
the SAR and the EAR, we study the case in which the RSA of the entire system is a Fisher log series, obtaining 



S k (A) = 



9 



d 2 z 



(r 



xI(A,z) 



x(l-I(A,z)) 



(19) 



In order to compare different length scales we do not use directly the RSA, which depends extensively on the observed 
area, instead we study the behavior of the Normalized Relative Species Abundance (NRSA). This quantity is defined as the 
probability p^ RSA that an observed species has a certain number of individuals and it could be obtained by normalizing 
the RSA to one (i.e. by dividing it with the SAR). In our case the NRSA of the entire system is a Fisher log-series, i.e. 



pNRSA 



by dividing it with the SAR). In our case the NRSA of the entire system is a Fisher log-series 
(Aq) = x k /(— k log(l — x)). In Figure|5]is shown the behavior of this probability at different length scale. We observe 
different behaviors at different scales. Very interestingly if we measure the parameter x via the large k behavior of Pg RSA (A), 
we find an effective parameter x e ff(A) which depends on the area observed A and is equal to x in the limit A ^> A^. Notably 
this effective x decreases with the observed area, as observed in empirical systems. 



IV. DISCUSSION 



In this article we have introduced a simplified version of the Poisson Cluster Processes apt to describe large homogeneous 
landscapes. This is not a microscopically based process, but describes the aggregation of individuals starting from simple 
phenomenological considerations. Within this framework, we have shown how one can relate the SAR to the beta-diversity 04l 
and to the RSA under simple and general assumptions. Secondly, we have obtained the tri-phasic SAR and identified how 
the exponent of the approximated power-law depends on the parameters of the RSA (e.g. the demographical parameter or the 
speciation rate). Finally we have obtained a formula for the EAR and an expression for the RSA at different scales. 

In order to disentangle different sources of information within species-area patterns, we first need to understand how general 
the assumptions are that can generate the observed patterns. If the qualitative shape of a curve can be captured by simple 
mathematical considerations, then it seems likely that ecological aspects drive finer and more quantitative details of the curve, 
although alternative explanations may hold as well. Our work shows that the tri-phasic shape of the SAR is a very general 
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pN RSA (A) is defined for k > and is obtained as the ratio between the RS A of equation 
left figure is for a value of x equal to 1 — 10~ 3 , while the right one is obtained for x = 1 
NRSA at largest scale (which is in our model a Fisher log-series), while the black continuous lines are the NRSA at different 
scales. As the area decrease the slope of the curve (which is directly related to x) decreases. Notably this effective x decreases 
with the observed area, as observed in empirical systems, since the smaller the system the smaller is the average population per 
species and so a rapid decreases of the RSA at large population, i.e. at smaller value of x 



pattern that emerges under simple and general geometrical considerations. Specifically, sampling individuals on local scales 
and the spatial aggregation of conspecific individuals on larger yet finite scales (note that this defines the characteristic length 
scale for /3-diversity) produce the two bending points in the SAR, eventually making the curve tri-phasic. Accordingly, the 
pattern is rather qualitatively insensitive to the implementation of specific ecological mechanisms, and thus it is not surprising 
that models based on very different hypotheses lfl9l [37l can account for tri-phasic SARs. Within this context it is possible to 
relate the prediction of the SAR with the form of the /3-diversity. The effects of inter- and intra-specific interaction, spatial 
heterogeneity and species' traits are important when dealing with the fine details of the curve and should be taken into account 
when a precise prediction is necessary. These mechanisms could influence, in a non trivial way, the final SAR curve and the 
value of the exponent z. 

We have shown that the exponent z (measured as the inflection point of the SAR curve) depends on the demographic parameter 
of the RSA distribution. Although the measured values of z reflect a more complicated dynamics which produces complex spatial 
patterns at intermediate scales, we find, that for realistic values of the demographical parameter x, the exponent z spans the 
empirically observed values. We have also obtained a way to infer the typical scales at which the power-law trend is observable. 
It is well known that the measure of the exponent depends on the scale we observe it ifTOl . However, we have shown that the 
range of scales where the measure of the exponent z is relatively more reliable is directly linked to the correlation length. 

In this work our results are expressed in terms of the demographical parameter x, the parameter characterizing the RSA at 
largest scale, i.e. the Fisher log-series. As shown in section |HI C\ x is not the demographic parameter of the system at every scale. 
When we observe the RSA at a smaller spatial scale, we obtain a different distribution. However an effective demographical 
parameter can be defined at each spatial scale in terms of the decay of the RSA tail at the same spatial scale. We get that this 
effective demographical parameter is an increasing function of the area as is also empirically observed. 

The assumption of non-interacting species within the same trophic level makes it possible to calculate from the SAR the 
probability to find a given number of different species in a certain area. We found that this probability is a Poisson distribution 
(in the limit of a large landscape A$, while it turns out to be a Binomial distribution when a finite A$ is considered). This quantity 
could be measured in available datasets and it represents a powerful way to identify the spatial scales over which neutrality is a 
good approximation and at what scales the interaction becomes macroscopically observable. 

Within our framework it is also possible to calculate an analytical expression for the endemic-area relationship (the number 
of species which are completely contained in a certain area) and its distribution. Interestingly, the EAR scales linearly at small 
scales. We obtained the linear scaling as expansion for small areas which is equivalent to the random placement model l33ll . In 
a recent work ll36l is shown that the random placement describes with a very good approximation the behavior of the EAR in 
different data set. These data set refers to systems with a finite total area A , therefore our model (which is valid for A <C A ) 
is not a good candidate to describe those systems. On the other hand, our framework is able to give an explanation of why the 
random placement works in a good way to describe the EAR, but is not able to reproduce the trend of the SAR. In fact we have 
shown that the the random placement is a good approximation for the EAR at scales lower than A2 (the typical space occupied 
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by a species), whereas it describes the trend of SAR only below the scale A-y (the typical area occupied by a single individual per 
species). Our model provide also an expression for the distribution of the EAR, allowing to calculate the typical area at which 
there is a non negligible probability to find an endemic species. 

The model could also be used to test and to compare the validity of the predictions obtained via scaling relations. For instance, 
it is possible to show that the scaling relations, which describe the behavior at local scales well EUll . are also valid for our model 
in the limit of small areas (where the random placement is recovered). This is not true for larger areas, where it could be 
interesting to study the appearance of new simple relations between the observed quantities. 

The model we propose can be extended in several different ways. Firstly, it would be useful to study how the SAR curve varies 
according to different sampling methods. It is known that the measured SAR depends on the sampling scheme (e.g., nested vs. 
independent) ifTOl and on the geometry of the sampled area [38]. It is not trivial to understand whether these differences are, in 
principle, simply quantified by geometrical considerations or whether they hide some biologically relevant aspects. Finally, it 
would be interesting to introduce non-neutral characteristics and inter-species interaction. 

The model we have introduced does not follow from any intrinsic dynamics but it captures, in a simplified and meaningful 
way, many different processes acting on different spatial and temporal scales. We have shown that regardless of any specific 
dynamics, the patterns observed in empirical studies, especially at large spatial scales, can be explained on the basis of quite 
general and simple processes. It would be interesting to incorporate simple dynamics into the model to assess how the spatial 
patterns are affected. 

We have proposed an analytically solvable model based on minimal assumptions. It allows us to calculate explicitly the SAR 
on an infinite landscape, and also the EAR and the scaling of the RSA. Although this approach neglects important characteristics 
of ecosystems, it allows us to understand the necessary (geometrical or biological) mechanisms at the core of the observed 
macroecological patterns and therefore to quantify the relative importance of the neglected effects. 
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Appendix A. CALCULATION OF P k (A\A ) 



In this section we want to calculate the probability to find exactly k individuals of a given species in a sample area A. This 
quantity is directly related to the SAR. We sketch this calculation starting from the hypotheses written in the main text. As 
explained before, the model we propose is a simplified version of the Poisson Cluster Processes [25 26] to which we refer for a 
more extensive and rigorous discussion. 

The model is neutral and non-interacting. This assumption makes possible to obtain an analytical expression for the SAR, 
because it implies that we can consider one species at a time. 

A simple and intuitive way to perform this calculation is to consider discrete space, write the probability we are interest for, 
and calculate the final result in the continuum limit. In order to distinguish the quantities defined in the continuum and on a 
lattice, we indicate a quantity with a T when it is considered in discrete space. 

Consider a homogeneous and isotropic lattice A with periodic boundary conditions. A site of this lattice is identified by 
a vector r. We assume that a single site could be empty or occupied by a single individual. We know from item [2] of our 
assumptions that the individuals of a species are distributed in a single cluster centered in a point of space x. We define p~i(r\x) 
as the probability that we find an individual in a point r given x to be the position of the center of the cluster, whereas the 
probability that we find the site r empty will be simply p~o(r\x) — 1 — p~i(r\x). 

Consider a set of sites A z — {r 1; . . . ,r|A-|} which has a cardinality \A Z \. We identify this set by labeling it with a point of 
the lattice z. We can calculate, by using the quantities we have just introduced, the probability to find k sites occupied and the 
others empty. It becomes 
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E 



(ti. 



II Pi(r\x) 



)eAz rG{r 



J 



n 



(Al) 



This expression defines the probability Pk(A z \x) to find k individuals when we are observing a set of sites A z , when the cluster 
of individuals is centered in a point x. This expression is valid without imposing any constraint on the set A z , but we want to 
interpret it as an area centered in a point of the space z, when the continuum limit will be performed. Thus we consider A z as a 
set of \A%\ sites distributed around the point z in such a way that this set converges in the continuum limit to a region A(z) with 
an area A centered in z. We are in principle not interested in the dependence on the location of the sample and on the location 
of the cluster center. Thus we have to average Pk(A z \x) over possible choices of x and z. We obtain the following expression 
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where p\ (r) = pi (r\0). 

Considering the definition of p\ (r), the average number of individual placed around a cluster center will be A = X) r eA P 1 to- 

We introduce a new quantity (f>(r) defined by the following relation pi(r) = \(/>(r). Note that <p(r) carries all the spatial 
information about p\ (r). 

To obtain the expressions in the continuum limit, we have to introduce a finite site spacing, define the scaling of the quantities 
respect to it and calculate the limit of vanishing site spacing. By performing this calculation in two dimensions we obtain 



P£(A\A ) = — d'z 

A JA a 



A:! 



exp(-A ' ^ 2 

lA(z) 



d r(f>(r) i . 



(A3) 



where Aq is the area of the whole landscape, A(z) is a region (e.g. a circle) centered in the point z and <fi(r) is the continuum 
limit of 4>(r). 

We would like to introduce in equation A4 our knowledge of the RSA S k (Aq) (see item|4]of our assumptions). The knowledge 
of the RSA gives us an information about the probability to find k individuals in the whole landscape (usually called Species 
Abundance Distribution, SAD): starting from the RSA we know that 



P k (A ) 



S k (A ) 
Stot(Ao) 



(A4) 



where Stot(Ao) is the total number of available species, which is given by X)^Lo S k (Ao) an d Pk(Ao) is the SAD. We want that 
the probability calculated with our model match the one obtained starting from the SAD when the whole landscape is considered. 
The expression calculated with our model in equation | A4| depends on a parameter A. We assume this parameter to be a random 
variable distributed in the interval (0, oo) accordingly to a probability distribution function p(X). This distribution p(X) will be 
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auto-consistently determined by imposing the matching between the model and the SAD when the whole system is considered. 
This procedure does not hide any particular ecological meaning, it is only a trick to perform the calculation and to impose the 
condition on the RSA. 

The probability obtained in equation A4 evaluated in an area A = Aq becomes a Poisson distribution with average A 

^o) = %e 

By introducing a distribution p(X) we obtain in the most general case 

A fe 



Pk(A ) 



dXp(X) — e := 

k\ btot(Ao) 



(A5) 



(A6) 



This expression defines p(X) in terms of the RSA. This equation is valid for k > 0, i.e. we are Stot counts even the species 
with zero abundance in the whole landscape (it is not a directly measurable quantity). In other words the probability P (A ) is 
generally different from zero. The total number of observable species (i.e. the species with at least one individual in the whole 
landscape) will be given by 



S(A ) = S tot (A )(l - / dX P (X)e- x ) 



(A7) 



By introducing p(X) in equation A4 we finally obtain 

Stot(Ao) 



S k (A\A ) := S tot {A Q )P k {A\A Q ) = 



and the number of species turns to be 
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Appendix B. DEPENDENCE ON p(X) AND <j)(r) 

Equation | A8 1 depends only on two functions: p(X) and cf>(r). These two functions are respectively related to the distribution 
of individuals in species and to the distribution of individuals in space. 

The probability distribution function p(X) is directly related to the Relative Species Abundance. The function <f>(r) was 
instead introduced as related to the probability that a site was or not occupied by one individual. Starting from the definition of 
the model, we observe that the two point correlation function is equal to 

G(r) = (X 2 ) f dPy<t>{y)<t>{y-r) , (A10) 
Jaq 

where (A 2 ) = dXX 2 p(X). By applying the Fourier transform, it is possible to invert this expression obtaining 

m = \[M- (aid 



This expression gives us a direct way to infer from data a form of the <fi(r) starting from the correlation function (which has the 
same functional dependence of the f3 -diversity). Note that, due to the normalization condition of (j)(r), it is sufficient to know 
the functional dependence of the correlation function (or of the (3 -diversity) to obtain the exact expression of 4>(r). An example 



of this calculation is shown in section Appendix F 



Appendix C. LIMIT A -> oo 



We are interested in calculating the following limit 



S(A) := lim S(A\A ) . (A12) 

Aq— tOC 
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In order to perform this limit, we have to know how Stot(Ao) scales with Aq when we consider the limit of large Aq. We affirm 
that the total number of species scale as 



Stot(A ) ~ stotA if A -> oo 



(A13) 



This scaling is not an assumption, instead it is a consequence of the fact that the total number of individuals scale with the area 
in the large area limit whereas the number of individual of a single species remains constant for sufficient large areas. Note that, 
due to equation A7 even S(Aq) follows a linear scaling: 



S(A ) ~ sA if A a -> 00 , 



where s is related to s to t via equation A7 



S = Stot 1 



(l - j dXp(X)e- x ) = Stot (l - P Q (A )) . 



By substituting the scaling of S(Aq) and the sum over k in equation A9 we obtain the following expression 

S{A) = s tot fd 2 z 



1 



d\p{X)e- X ^^) dr -^ 



(A14) 



(A15) 



(A16) 



which is our central result. 



Appendix D. DIMESIONAL ANALYSIS 



The equation | A 1 6 | depends at least on three parameters: the density of species s tot , the parameter of the RSA (there is at least 
a single parameter which appears in the distribution p(A)) and the correlation length £ (which appears in <f>(r)). The parameter 
Stot (which is not directly measurable, because it represent the density of available species) could by related to the density of 



observable species s by equation A15 It is possible to determine the functional form of the SAR, by using the dimensional 
analysis. The SAR is a function of A, which has the dimension of an area. The parameter s is a density and thus it has the 
dimension of an inverse of area, while £ is a length. The parameter s appears as a multiple of the entire expression leading to the 
dimensionalless result: 



S(A) = sAf^) 



(A17) 



The function / depends also on the dimensionless parameter appearing in the RSA. 

Note that if we did not consider the limit A — > 00 (i.e. we were interested in finite-size scaling) the SAR would also depend 
on the size of the system Aq and thus the functional dependence would be more complicate. 



Appendix E. EXPANSION OF SAR FOR SMALL AND LARGE AREAS 

The starting point to perform the expansions at small and large areas is the equation|5]of the main text: 



S(A) = s t ot 



1 - 



(A18) 



As written above, this equation depends at least on three parameters and has the form written in equation |A17| In order to 
calculate the limit of small or large areas we have to evaluate the previous expression for small or large ratios A/£ 2 . 



Small area expansion. When we consider small areas the integral f 



of equation A18 we obtain 



AM' 



(r) tends to A<ft(z). Expanding the exponential 



S(A) ~ stotA / dXXp(X) = stotA(X) = (p)A if A « £ 2 



(A19) 



Note that ( A) is equal to the average number of individuals per species (fc) (see equation A6 1, and thus s tot (A) is equal to the 
average density of individuals ^p). 
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Large area expansion. Consider the integral f A , z -, d D r_(f>(r) for large areas. We know that </>(r) is a function which decreases 
sufficiently rapidly for large areas, with a typical scale £. Thus for large areas the integral could be well approximated by the 
characteristic function xa(o) fe) (which is equal to 1 if z belongs to the region A(0) and it is zero otherwise). We obtain 



S(A) 



Slot 



d 2 z 



1 - 



d\p(\)e 



--VXA(O) (z) 



= s tot (l - P Q (A )) =sA ifA»£ 2 



(A20) 



where s is the density of the species we observe in the entire system and is related to s tot via equation A15 



Appendix F. A CHOICE FOR 4>(r) AND p(X) 



One of the most known form for the Relative Species Abundance is the Fisher log-series [35], which is defined as 



S k (A ) = e— ifk> 1, 



(A21) 



where 6 > and x £ (0, 1) are the two parameters of the distribution. The total number of observable species will be 

oo 

S(A ) = ^ 5 fe (A ) = -0 log(l - ar) . (A22) 



fc=i 



We have shown is section [Appendix C| that the number of observable species in the entire system scales linearly with Aq if Aq 
in sufficiently large. We assume that for large Aq, 9 ~ Aq whereas x does not depend on it. This assumption respects the 
requested scaling properties of S(Aq) and it is in agreement with the microscopic interpretation of the Fisher log-series (e.g. via 
birth-death process). We define 



lim — — 

A ->oo Aq 



(A23) 



The Fisher log-series was obtained, in the original derivation l35l . as an appropriate limit of a convolution between a Gamma 
distribution and a Poisson distribution 



S k (A ) = lim 5(e) 



dX- 



e -\/S X e-l X k e - 



r(c)5« 



A:! 



(A24) 



A24 



The parameter 8 is defined as the limit of 5(e) /T(e) for e — > 0, whereas x is defined as 5/(1 + 6). We can see that equation 
give us a recipe to choose the function p(X), because the RSA is exactly written in the same form of equation A6 Thus in order 
to impose the Fisher log-series as the RSA for the entire landscape, we have to choose p(X) as a appopriate limit of the Gamma 
distribution. 

We can obtain an explicit expression for the SAR by substituting our choice of p(X). For a finite area A we obtain 



S k (A\A Q ) 
where I(z, A) is defined as 



± ( * z lim 5(e) f^(M^ 



fc! 



I(A,z) 



A{z) 



(r)d 2 r 



(A25) 



(A26) 



Performing the integral in equation A25 taking the limit e-)0 and the limit Aq —> oo and summing over k from 1 to oo we 
finally obtain the following expression for the SAR 



S(A)=0 / dz\og 



(l-x(l- I(A,z}) 



1-x 



log 



■ l-x(l-I(A,z)) n 

1 — X i 



- log(l - X) 



(A27) 



Note that this expression when expanded for large and small area, follows the scaling obtained in section Appendix E as expected. 
To obtain a tractable expression we have also to specify a recipe for the function 4>(r). As demonstrated above this function 



could be related to the two point correlation function (or the /3-diversity) by equation A10 The two point empirical correlation 
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function could be for example fitted by a Bessel function Kq{t/£) |34| Following the procedure sketched in section Appendix 
[b] we can obtain a functional form for <p{r_) by calculating the Fourier transform of the two point correlation function, which for 
the choice of the Bessel function turns to be 

G( P ) cx — l — , (A28) 
- 1 + C p 

by taking its square root and by applying the Fourier anti-transform, we finally obtain 

e -|fcll/£ 

+® = » ' (A29) 



In this expression the proportionality constant was fixed by imposing the normalization condition J <fi(r)dr = 1 (see section Ap 



pendix B I 



Thus with the choice for <j){r) expressed above, the integral I(z, A) becomes 



e-lldl/e 

I(A,z)= I edlr-zH-^-p-— d 2 r, (A30) 



where we are considering a circular region A(z) with an area A = nR 2 



Appendix G. SCALES 



The tri-phasic SAR, as shown in figure[T[ seems to have two separate length scales A\ and A 2 . The first one separates the the 
linear trend at low scales with the power-law region, the second one is the boundary between the power-low intermediate region 
and the linear trend at large scales. We show in this section that our model give an expression for both the scales starting from 
only one length scale £ (the correlation length). 

Observing the figure [2] we can understand the mechanism which produces the observed pattern. The scale A 2 above which 
we obtain the linear scaling is the typical area occupied by a species: above it we have sampled the entire population of a single 
species. This scale depends only on £ and on the form of the correlation function (see Figure [3). 

The first scale A\ is determined by the typical minimum distance between two conspecific individuals (i.e. the average distance 
between one individual and the nearest conspecific): below this length scale, the sampled individuals belong to different species 
and thus the scaling is linear, above it the curve starts to bend down because we are sampling multiple individuals of the same 
species. This quantity could be well estimated from the RSA as the average of the reciprocal of the density (calculated in the 
area where the species live), which gives the typical area occupied by only one individual of a given species. Note that the 
distance between one individual and its nearest conspecific is well defined only if the species we are considering has at least 
two individuals. Let us pick an individual at random (chosen between the individuals belonging to the species with a population 
of at least two individuals). It will belong to a species with k individuals. The portion of area in which this individual is the 
only one belonging to its species will be well approximated by A 2 /k. Let us pick an individual, the probability that it belongs 
to a species with k individuals is proportional to kP k . We thus have to average this quantity with the probability to pick an 
individual of a species with a total number of individuals equal to k restricted to the condition to have at least two individuals 
(i.e. kP k /(J2 k>2 kP k )). We obtain 



OC 



v A 2 kS k (A ) E k >2Sk(A ) 

1 h k sr= 2 ks *( A o) z k > 2 ks k (A ) 2 - 1 } 



If this expression is evaluated for the choice of the Fisher Log-Series, it becomes 



A 1 = h{x)A 2 = (1 - x) X y X) A 2 . (A32) 



Appendix H. ENDEMIC AREA RELATIONSHIP 

In this section, following the same procedure used to calculate the SAR, we obtain an expression for the Endemic Area 
Relationship (EAR) in a large homogeneous system. The EAR is defined as the average number of species whose population is 
completely contained in an area A. 
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Given a system of area Aq, the number of endemic species in an area A will be equal to the number of species, with at least 
one individual in Ao, which do not have an individual outside A (i.e. in the area Aq\A). We obtain a relation between the SAR 
and the endemic area relationship E(A) 

E(A\A ) = S(A ) - S(A \A\A ) = Sto j Ao) E(A) = 

(A3 3) 

[ d?z r d\p(\)e~ x [e A Sauo d ^ - 1] , 

Aq J An JO 



which, in the continuous limit, becomes equation 13 of the main text. 

By using the same arguments used for the distribution of the number of species, it is possible to demonstrate that the probability 
to find k endemic species in an area A is a Poisson distribution of average E(A), i.e. 

P k E (A) = exp(-E(A)) . (A34) 



Acknowledgement 

J.G. thanks B. Bassetti, M. Cosentino Lagomarsino and A. Sanzeni for many useful discussions. S.A. was supported by the 
EU FP7 SCALES project ("Securing the Conservation of biodiversity across Administrative Levels and spatial, temporal and 
Ecological Scales"; project No. 26852). A.M. thanks Cariparo foundation for financial support. We thank S.J. Cornell and W.E. 
Kunin for insightful discussions. 



[1] Olof Arrhenius. Species and area. The Journal of Ecology, 9:95 - 99, 1921. 

[2] R. H. Macarthur and E. O. Wilson. The Theory of Island Biodiversity . Princeton University Press, Princeton, N.J., 1967. 
[3] Robert M May. Island biogeography and the design of wildlife preserves. Nature, 254(5497): 177 - 178, 1975. 

[4] M. Williamson. Relationship of species number to area, distance and other variables, pages 91-115. Chapman and Hall, London, 1988. 
[5] FL He and P Legendre. On species-area relations. American Naturalist, 148(4):7 19-737, OCT 1996. 

[6] David Storch, Arnost L. Sizling, and Kevin Gaston. Scaling species richness and distribution: Uniting the species-area and species- 
energy relationships. Cambridge University Press, Cambridge, 2007. 
[7] David Storch, Arnost L Sizling, Jiri Reif, Jitka Polechova, Eva Sizlingova, and Kevin J Gaston. The quest for a null model for macroe- 

cological patterns: geometry of species distributions at multiple spatial scales. Ecology letters, 1 1(8):771— 84, August 2008. 
[8] Hector Garcia Martin and Nigel Goldenfeld. On the origin and robustness of power-law species-area relationships in ecology. Proceedings 

of the National Academy of Sciences of the United States of America, 103(27):10310-5, July 2006. 
[9] John Harte, Adam B Smith, and David Storch. Biodiversity scales from plots to biomes with a universal species-area curve. Ecology 
letters, 12(8):789-97, August 2009. 
[10] Stina Drakare, Jack J Lennon, and Helmut Hillebrand. The imprint of the geographical, evolutionary and ecological context on species- 
area relationships. Ecology letters, 9(2):215-27, February 2006. 
[11] Stephen P. Hubbell. The Unified Neutral Theory of Biodiversity and Biogeography . Princeton University Press, 2001. 
[12] Jerome Chave. A Spatially Explicit Neutral Model of /3-Diversity in Tropical Forests. Theoretical Population Biology, 62(2): 153-168, 
September 2002. 

[13] Igor Volkov, Jayanth R. Banavar, Stephen P. Hubbell, and Amos Maritan. Neutral theory and relative species abundance in ecology. 

Nature, 424(6952): 1035-7, August 2003. 
[14] Jerome Chave. Neutral theory and community ecology. Ecology Letters, 7(3):241-253, February 2004. 

[15] Tommaso Zillio, Igor Volkov, Jayanth R. Banavar, Stephen P. Hubbell, and Amos Maritan. Spatial Scaling in Model Plant Communities. 

Physical Review Letters, 95(9): 1-4, August 2005. 
[16] David Alonso, Rampal S Etienne, and Alan J McKane. The merits of neutral theory. Trends in ecology & evolution, 21 (8):45 1—7, August 

2006. 

[17] Sandro Azaele, Simone Pigolotti, Jayanth R Banavar, and Amos Maritan. Dynamical evolution of ecosystems. Nature, 444(7121):926-8, 
December 2006. 

[18] Igor Volkov, Jayanth R. Banavar, Stephen P. Hubbell, and Amos Maritan. Patterns of relative species abundance in rainforests and coral 

reefs. Nature, 450(7 166):45-9, November 2007. 
[19] James Rosindell and Stephen J Cornell. Species-area relationships from a spatially explicit neutral model in an infinite landscape. Ecology 

letters, 10(7):586-95, July 2007. 

[20] Tommaso Zillio, Jayanth R Banavar, Jessica L Green, John Harte, and Amos Maritan. Incipient criticality in ecological communities. 

Proceedings of the National Academy of Sciences of the United States of America, 105(48): 1871 4-7 , December 2008 . 
[21] N.G. Van Kampen. Stochastic Processes in Physics and Chemistry. North Holland, 1981. 



16 



[22] R K P Zia and B Schmittmann. Probability currents as principal characteristics in the statistical mechanics of non-equilibrium steady 
states. Journal of Statistical Mechanics: Theory and Experiment , 2007(07):P07012-P07012, July 2007. 

[23] James P O'Dwyer and Jessica L Green. Field theory for biogeography: a spatially explicit model for predicting patterns of biodiversity. 
Ecology letters, 13(l):87-95, January 2010. 

[24] Jacopo Grilli, Sandro Azaele, Jayanth Banavar, and Amos Maritan. Lack of detailed balance in a spatial explicit neutral model. Submitted 
for publication. 

[25] Noel Cressie. Statistics for Spatial Data (Wiley Series in Probability and Statistics). Wiley-Interscience, 1993. 

[26] Janine Illian, Jesper M0ller, and Rasmus Waagepetersen. Hierarchical spatial point process analysis for a plant community with high 
biodiversity. Environmental and Ecological Statistics, 16:389-405, 2009. 10.1007/sl0651-007-0070-8. 

[27] Marjorie Thomas. A generalization of poisson's binomial limit for use in ecology. Biometrika, 36(l/2):pp. 18-25, 1949. 

[28] Jerzy Neyman and Elizabeth L. Scott. Statistical approach to problems of cosmology. Journal of the Royal Statistical Society. Series B 
(Methodological), 20(l):pp. 1-43, 1958. 

[29] Peter J. Diggle. Statistical Analysis of Spatial Point Patterns (Mathematics in Biology). Academic Pr, 1984. 

[30] J B Plotkin, M D Potts, N Leslie, N Manokaran, J Lafrankie, and Peter S Ashton. Species-area curves, spatial aggregation, and habitat 

specialization in tropical forests. Journal of theoretical biology, 207(l):81-99, November 2000. 
[31] Helene Morion, George Chuyong, Richard Condit, Stephen P. Hubbell, David Kenfack, Duncan Thomas, Renato Valencia, and Jessica L 

Green. A general framework for the distance-decay of similarity in ecological communities. Ecology letters, 1 1(9):904-17, September 

2008. 

[32] Sandro Azaele, Stephen J Cornell, and William E Kunin. Downscaling species occupancy from coarse spatial scales. Ecological 

Apllications, 22(3):1004-1014, 2012. 
[33] Bernard D. Coleman. On random placement and species-area relations. Mathematical Biosciences, 54(3-4): 191 - 215, 1981. 
[34] Richard Condit, Nigel Pitman, Egbert G Leigh, Jerome Chave, John Terborgh, Robin B. Foster, Percy Nunez, Salomon Aguilar, Renato 

Valencia, Gorky Villa, Helene C Muller-Landau, Elizabeth Losos, and Stephen P. Hubbell. Beta-diversity in tropical forest trees. Science 

(New York, N. Y. ), 295(5555):666-9, January 2002. 
[35] R. A. Fisher, A Steven Corbet, and C B Williams. The Relation Between the Number of Species and the Number of Individuals in a 

Random Sample of an Animal Population. The Journal of Animal Ecology, 12(1):42, May 1943. 
[36] Fangliang He and Stephen P. Hubbell. Species-area relationships always overestimate extinction rates from habitat loss. Nature, 

473(7347):368-71, May 2011. 

[37] M. A. M. de Aguiar, M. Baranger, E M Baptestini, L. Kaufman, and Y. Bar- Yam. Global patterns of speciation and diversity. Nature, 
460(7253):384-7, July 2009. 

[38] WE Kunin. Sample shape, spatial scale and species counts: Implications for reserve design. Biological Conservation, 82(3):369-377, 
DEC 1997. 



